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TITLE OF THE INVENTION 

DIAGNOSIS. PROGNOSIS AND TREATMENT OF 
TRINUCLEOTIDE REPEAT-ASSOCIATED DISEASES AND INTRANUCLEAR 
INCLUSIONS-ASSOCIATED DISEASES 

FIELD OF THE INVENTION 

The present invention relates to neurodegenerative disorders. 
More particularly, the invention relates to Machado-Joseph disease (MJD). The 
present invention also relates to trinucleotide repeat expansions and more 
particularly to CAG repeats, also termed expansions of a coding CAG repeat 
(exp-CAG). and GCG repeats. More particularly, the invention relates to: (1) 
exp-CAG associated diseases; (2) intranuclear inclusions (INI) in patients and 
cellular models of exp-CAG associated diseases; and (3) the elucidation of the 
mechanism responsible for the toxic effects of such repeats. The present 
invention therefore relates to the diagnosis, prognosis and treatment of repeat- 
associated diseases and INI-associated diseases as well as to assays for the 
identification of agents which could be used for the treatment of such diseases or 
disorders. 

BACKGROUND OF THE INVENTION 

Coding CAG triplet repeat expansions cause several 
20 neurodegenerative disorders, including Machado-Joseph disease (MJD)^^. The 
presence of intranuclear filamentous inclusions (INI) containing expanded protein 
in MJD, as well as in other expanded CAG repeat disorders ^xp-CAG), have lead 
to a nuclear toxicity model ^- Similar INI are found in oculopharyngeal 
muscular dystrophy, which is caused by a short expansion of analantne encoding 
25 GCG repeat. According to the present invention, it is proposed that 
transcriptional ortranslationalframeshifts occurring within expanded CAG tracts 
result in the production and accumulation of polyalanin -containing mutant 
proteins. Th se alanine polymers might deposit in cells forming INI and I ad to 
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nuclear toxicity. Support for this disease model is provided using lymphoblast 
cells from MJD patients, as well as in pontine neurons of MJD brain and in in vitro 
cell culture models of the disease. Evidence that alanine polymers alone are 
toxic to cells is also provided and strongly suggests that a similar pathogenic 
5 mechanism underlies the other CAG repeat disorders. 

Indeed, recent reviews describe a significant number of 
neurodegenerative diseases, including Huntington disease as well as spinal 
cerebellar ataxias which are caused by CAG repeat expansions. Of note. CAG 
repeats code for polyglutamine in the protein containing same, it is commonly 

10 believed that these polyglutamine stretches in proteins are toxic to cells, and 
these repeats are also termed CAG/polyglutamine repeats (Iveret al. 1999, 
Nature Medicine 5:383-384). These diseases, also termed polyglutamine 
diseases, are thought to occur by "a gain function mechanism". Unfortunately, 
the mechanism explaining toxicity of the polyglutamine diseases, apparently 

15 through an aggregation in nuclear inclusions, has yet to be provided, although 
transgenic mice bearing a polyglutamine repeat in a recombinant protein were 
shown to display intranuclear polyglutamine inclusions (Hardy et al. 1998, 
Science 282:1076-1079). Of relevance, although the pathogenic effect of these 
inclusion bodies is not clearly understood, it is recognized that numerous types 

20 of genes can contain these so-called CAG repeats, and that while these repeats 
are linked to the disease, the genes containing these repeats are "largely 
irrelevant to the disease process" (Hardy et al. ipid., supra). 

INI are also found in oculopharyngeal muscular dystrophy 
(OPMD)^°, which is caused by short expansions of a polyalanlne (polyAla) 

25 encoding GCG tract in the PABP2 gene^^ (also see PCT/CA98/01133). In 
contrast to the CAG repeat disorders, where expansions frequently involve the 
addition of 20 or more codons, very small GCG expansions (exp-GCG) of 2 to 7 
additional codons are seen in dominant OPMD, suggesting thatpolyalanine tracts 
are prone to aggregation and may be very toxic^\ This contention is supported 
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by the observation thatpolyAla peptides containing more than 9 alanines (Ala) in 
a row form p-pleated sheet fibrillar macromolecules spontaneously in vitro^^, 
which in turn are extremely resistant to chemical and enzymatic degradation^^. 

Short trinucleotide repeat expansions causing a human 
5 disease have been first described in PCT application number PCT/CA98/0113, 
of Rouleau et al., which teaches that the addition of only two GCG repeats (which 
encode the amino acid alanine [ALA]) is sufficient to cause dominant OPMD. 
OPMD expansions do not share the cardinal features of "dynamic mutations". The 
GCG expansions are not only short they are also meiotically quite stable. 
10 Furthermore, there is a clear cut-off between the normal and abnormal alleles: a 
single GCG expansion causing a recessive phenotype. The PAB II (GCQ7 allele 
was thus the first example of a relatively frequent allele which can act as either 
a modifier of a dominant phenotype or as a recessive mutation. A dosage effect 
of these repeats is also disclosed in PCT/CA98/01 133, since a patient having an 
15 expansion in the poiyaianine tract of the HOXD13 protein (Akarsu,. et a!., 1996, 
Hum. Mol. Genet. 5: 945-952) has more severe deformities. A duplication causing 
a similar poiyaianine expansion in the subunit 1 gene of the core-binding 
transcription factor CBF(1) has also been found to cause dominant cleido-cranial 
dysplasia (Mundlos, S. et al.,1997, Cell 89:773-779). Of note, however, the 
20 mutations in these two rare diseases are not triplet-repeats. They are duplications 
of "cryptic repeats" composed of mixed synonymous codons and are thought to 
result from unequal crossing over (Warren, 1997, Science 275 : 408-409). In the 
case of OPMD, slippage during replication causing a reiteration of the GCG 
codon is a more likely mechanism (Wells. 1996, J. Biol. Chem.^: 2875-2878). 
25 Different observations converge to suggest that a gain of 

function of PAB II may cause the accumulation of nuclear filaments observed in 
OPMD (Tome et aL, 1980, Acta Neuropath. 49: 85-87). PAB II is found mostly in 
dimeric and oligomeric form (Nemeth. et al.,1995, Nud ic Acids R s.23: 4034- 
4041). It is possible that the poiyaianine tract plays a role in polymerization. 
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Polyalanine stretches have been found in many other nuclear proteins such as 
the HOX proteins, but their function is still unknown (Davies, et al.. 1997, Cell90: 
537-548). Alanine is a highly hydrophobic amino acid present in the cores of 
proteins, in dragline spider silk, polyalanine stretches are thought to form B-sheet 
5 structures important in ensuring the fibers' strength (Simmons, A.H. et al.. 
Science 271 :4 -87 (1996)). Polyalanine oligomers have also been shown to be 
extremely resistant to chemical denaturation and enzymatic degradation (Forood 
et al. 1995, Bioch. and Biophy. Res. Com. 211:7-13). Their role in the disease 
process, however, has still not been clearly identified. The more severe 

10 phenotypes observed in homozygotes for the (GCQ9 mutations and compound 
heterozygotes for the (GCG)9 mutation and (GCG)7 allele may con^spond to the 
fact that in these cases PAB II oligomers are composed only of mutated proteins. 
The ensuing faster filament accumulation could cause accelerated cell death.The 
recent description of nuclear filament inclusions in Huntington's disease, raises 

15 the possibility that "nuclear toxicity" caused by the accumulation of mutated 
homopolymeric domains is involved in the molecular pathophysiology of other 
triplet repeat diseases (Davies, S.W. et al., Cell 90: 537-548 (1997); Scherzinger, 
E. et al., Cell 90:549-558 (1997); DiFiglia. M. et al.. Science 277:1990-1993 
(1997)). Additional data, including immunocytochemical and expression studies 

20 will have to be provided to test this pathophysiological hypothesis and provide 
some insight into why certain muscle groups are more affected, while ail tissues 
express PAB II. 

There thus remains a need to elucidate the mechanism by 
which GCG and CAG expansions are toxic to cells. There also remains a need 
25 to provide diagnosis and/or prognosis and/or treatment tools for diseases 
associated with GCG or CAG repeats. 

The present invention seeks to meet these and other needs. 

The present description refers to a number of documents, the 
content of which is herein incorporated by refer nee in their entirety. 
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SUMMARY OF THE INVENTION 

The invention therefore concerns the identification of the 
mechanistic action of expanded CAG tracts in cell pathogenesis, cell death or 
disease. More specifically, the invention relates to the identification that 
5 mutational events within these CAG enable them to encode poly alanine stretches 
which accumulate in intranuclear filamentous inclusions and somehow trigger 
toxicity in cells. In particular, the present invention relates totranslational and/or 
transcriptional frameshift events occurring within the CAG tracts, thereby resulting 
in the production and accumulation of polyalanine-containing mutant proteins, 
10 The present invention, in addition, relates to a formal 

identification of the toxic effects of polyalanine stretches on nuclear toxicity. The 
Applicant is thus the first to have demonstrated that CAG expansions can give 
rise to production of mutant proteins containing polyalanine stretches. The 
Applicant is also the first to have demonstrated that polyalanine stretches in 
15 proteins are indeed toxic to cells. 

The instant invention also relates to gene replacement 
technologies aimed at deleting a repeat-containing protein giving rise to a mutant 
protein by a normal corresponding protein (i.e. lacking the repeat or having a 
smaller repeat). 

20 The present invention also provides the means to determine 

a predisposition to developing a disease or condition associated with the 
expression of a polyamino acid-containing protein such as polyalanine-containing 
proteins. This detemiination could thus enable a better prognosis of the disease 
and condition and enable a determination of the t>est treatment or prevention of 

25 the disease or condition. 

Another aim of the present invention is thus to provide means 
to screen humans (and more broadly animals) to identify those that might have 
a predisposition to developing a dis ase associated with the expression of genes 
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genes. 

It is thus an aim of the present invention to provide the means 
to better manage such disease prevention and intervention programs. 
5 Before the present invention, CAG repeat expansions (also 

known as CAG repeats or CAG/polyglutamine stretches) were recognized as a 
common denominator in numerous neurological diseases and were thought to 
code for polyglutamine stretches in the mutant protein. These polyglutamine 
stretches were thought to confer "a gain of toxic property to these proteins" 

10 (Iver et al. supra) by a mechanism that was not understood (Hardy et at. supr$. 
Indeed, CAG tract toxicity was also referred to as polyglutamine diseases. The 
present invention demonstrates that these CAG repeats actually encode 
polyalanine stretches. 

Prior to the present invention, the identification of CAG 

15 repeats in a gene correlating with a neurological disease led to the classification 
of such disease in the polyglutamine diseases (Hardy et al. supra). The present 
invention now demonstrates that polyalanine stretches, as opposed to 
polyglutamine stretches, are responsible for the diseases. 

In view of the above, the present invention opens the way to 

20 numerous methods of diagnosing, prognosing or treating CAG repeat-dependent 
diseases or conditions. In addition, it provides means to diagnose, prognose and 
treat diseases or conditions associated with polyalanine-containing proteins (i.e. 
GCG repeats). Non-limiting examples thereof comprise methods using ligands 
(i.e. polyclonal and monoclonal antibodies), nucleic acid sequences, restriction 

25 length polymorphisms (RFLPs) and the like. 

While the instant invention is more particularly directed to 
neurological diseases, as is demonstrated with Machado-Joseph disease, it 
should be understood that the present invention should not be so limited, indeed, 
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polyalanine stretches have been shown to be responsible for non-neurological 
diseases such as, for example, a muscle disease (OPMD; PCT/CA98/01133). 

In order to better understand the disease process associated 
with the presence of INI inexp-CAG and exp-GCG associated disorders, a direct 
5 assessment of whether polyAla stretches accumulated as intranuclear protein 
aggregates was carried out. Experiments were thus performed to analyze 
whether rare transcriptional or translational frameshifts in targe CAG stretches 
resulting in new reading frames with the formation of a hybrid protein containing 
a mixed polyglutamine/poly-alanine tract occurred. Additionally, it was of 

1 0 importance to assess whether the resultant polyAla peptides accumulate in nuclei 
where they form INI. 

Surprisingly, it was discovered that an antiserum raised 
against the hypothetical COOH terminus of the predicted polyAla containing 
frameshifted ataxin-3 protein (MJD-Ala) detects the frameshifted species in 

15 lymphoblastoid cells from MJD patients with large CAG tracts. This antiserum 
detects these polyAla tracts as insoluble macromolecules on Westem blots, and 
as intranuclear inclusions by immunocytochemistry. Frameshifted species were 
also present in INI in pontine neurons of MJD brain. Transfection of COS-7 cells 
with fulNength MJD-1 fused to the enhanced green fluorescence protein (EGFF) 

20 gene in the aitemative polyAla reading frame also leads to EGFP accumulation 
preferentially when the CAG tract is expanded. 

Of interest, it is also demonstrated that long CAG repeats are 
prone to frameshifts, which result in accumulation of the predicted polyAla- 
containing inclusions. Transfection of COS-7 cells with mutated MJD-1 

25 constructs containing alanine-coding GCA stretches results in a more severe 
phenotype when compared to their CAG counterparts. Furthermore. transfected 
polyAla-encoding GCA stretches alone are toxic and form aggregates. 

How these accumulations lead to cell death still needs to be 

elucidated. 
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A frameshift error occurring within a CAG tract thus results in 
the alternate alanine-encoding GCA frame. Many authors have reported 
frameshifts at the level of transcription or translation. The observation of 
transcriptional errors of the p-amyloid precursor protein and ubiquitin-B in 
5 Alzheimer's disease^^, and apoB86^^, supports the existence of such errors, and 
their role in disease pathogenesis. Translational errors have also been shown to 
occur and may be the basis for the formation of frameshifted proteins^®. 

In accordance with one embodiment of the present invention, 
there is therefore provided a method for the diagnosis of a disease associated 
10 with protein accumulation in intranuclear inclusions, which comprises obtaining 
a sample of a patient and determining a presence of the protein accumulation in 
the intranuclear inclusions, wherein this protein accumulation is indicative of a 
disease related thereto. 

In accordance with another embodiment of the present 
1 5 invention, there is also provided a method for the screening of agents which can 
modulate at least one of (1) a polyamino acid-containing protein expression; (2) 
accumulation of polyamino acid-containing proteins in intranuclear inclusions; and 
(3) toxicity to cells, which comprises: a) incubating a cell harboring an expression 
vector of the present invention, comprising a repeat domain which can give rise 
20 to a polyamino acid-containing protein associated with a disease or condition in 
an animal, with a compound; and b) assessing one of (1) an expression of the 
polyamino acid-containing protein; (2) accumulation of the polyamino acid- 
containing protein; and (3) toxicity to cells; whereby a modulator is selected when 
the agent significantly modulates one of the expression, accumulation and 
25 toxicity, as compared to a control agent. 

The instant invention also relates to GCG repeats encoding 
polyalanine stretches and their association with protein accumulation in a cell 
nucleus, swallowing difficulty and/or ptosis in a patient. In accordance with 
another embodiment of the present invention, there is provided a method for the 
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diagnosis or prognosis of a disease associated with protein accumulation in a cell 
nucleus, and/or swallowing difficulty and/or ptosis in a human patient, which 
comprises: 

a) obtaining a sample of a patient; and 

5 b) dete*' raining the extract of the polyalanine stretch in an alanine 

stretch-containing protein having the amino acid sequence: 

Met(Ala)6«Ala, 
wherein n is selected from 0 to 7, and 

whereby an n equal to 1 to 7 is indicative of a disease related 
1 0 with the protein accumulation in the nucleus, and/or a swallowing difficulty and/or 
ptosis in the patient. In a related aspect of the present invention, there is provided 
a human PAB II protein comprising a polymorphic GCG repeat encoding a 
polyalanine stretch having the sequence 
Met(Ala)6*„Ala, 

15 wherein n is 0. and wherein the sequence is indicative of a non*disease 
phenotype associated with protein accumulation in a cell nucleus, swallowing 
difficulty, and/or ptosis in a human patient. 

In order to provide a clear and consistent understanding of 
terms used in the present description, a number of definitions are provided 

20 hereinbelow. 

Nucleotide sequences are presented herein by single strand, 
in the 5* to 3' direction, from left to right, using the one letter nucleotide symbols 
as commonly used in the art and in accordance with the recommendations of the 
lUPAC-lUB Biochemical Nomenclature Commission. 
25 Unless defined otherwise, the scientific and technological 

terms and nomenclature used herein have the same meaning as commonly 
understood by a person of ordinary skill to which this invention pertains. 
Generally, th procedures for cell cultures, infection, mol cular biology m thods 
and the lik are common methods used in the art. Such standard t chniqu s can 
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be found in reference manuals such as for example Sambrook et al. (1989, 
Molecular Cloning - A Laboratory Manual. Cold Spring Harbor Laboratories) and 
Ausubel et al. (1994, Current Protocols in Molecular Biology. Wiley. New York). 

The present description refers to a number of routinely used 
5 recombinant DNA (rDNA) technology temris. Nevertheless, definitions of selected 
examples of such rDNA terms are provided for clarity and consistency. 

As used herein, "nucleic acid molecule", refers to a polymer of 
nucleotides. Non-limiting examples thereof include DNA (i.e. genomic DNA, 
cDNA) and RNA molecules (i.e. mRNA). The nucleic acid molecule can be 
10 obtained by cloning techniques or synthesized. DNA can be double-stranded or 
single-stranded (coding strand or non-coding strand [antisense]). 

The term "recombinant DNA" as known in the art refers to a 
DNA molecule resulting from the joining of DNA segments. This is often referred 
to as genetic engineering. 
15 The term "DNA segment", is used herein, to refer to a DNA 

molecule comprising a linear stretch or sequence of nucleotides. This sequence 
when read in accordance with the genetic code, can encode a linear stretch or 
sequence of amino acids which can be referred to as a polypeptide, protein, 
protein fragment and the like. 
20 The terminology "amplification pair" refers herein to a pair of 

oligonucleotides (oligos) of the present invention, which are selected to be used 
together in amplifying a selected nucleic acid sequence by one of a number of 
types of amplification processes, preferably a polymerase chain reaction. Other 
types of amplification processes include ligase chain reaction, strand 
25 displacement amplification, or nucleic acid sequence-based amplification, as 
explained in greater detail below. As commonly known in the art. theoligos are 
designed to bind to a complementary sequence under selected conditions. 

The nucleic acid (i.e. DNA or RNA) for practicing the present 
invention may be obtained according to well known methods. 
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As used herein, the term "physiologically relevant" is meant to 
describe a frameshifting event which can result in the production of a toxic protein 
in vivo. 

Oligonucleotide probes or primers of the present invention 
5 may be of any suitable length, depending on the particular assay format and the 
particular needs and targeted sequences employed. In general, the 
oligonucleotide probes or primers are at least 12 nucleotides in length, preferably 
between 15 and 24 molecules, and they may be adapted to be especially suited 
to a chosen nucleic add amplification system. As commonly known in the art, the 
10 oligonucleotide probes and primers can be designed by taking into consideration 
the melting point of hydrizidation thereof with its targeted sequence (see below 
and in Sambrook et al., 1989, Molecular Cloning - A Laboratory Manual, 2nd 
Edition, CSH Laboratories; Ausubel et aL, 1989, in Cunrent Protocols in Molecular 
Biology, John Wiley & Sons Inc., N.Y.). 
15 The terms "oligonucleotide" or "DNA" molecule or sequence 

refers to a molecule comprised of the deoxy ribonucleotides adenine (A), guanine 
(G). thymine (T) and/or cytoslne (C). in a double-stranded or single-stranded 
form. The term "oligonucleotide" or "DNA" can be found in linear DMA molecules 
or fragments, viruses, plasmids, vectors, chromosomes or synthetically derived 
20 DNA. As used herein, particular DNA sequences may be described according to 
the normal convention of giving only the sequence in the 5* to 3* direction. 

"Nucleic acid hybridization" refers generally to the 
hybridization of two single-stranded nucleic acid molecules having 
complementary base sequences, which under appropriate conditions will form a 
25 thermodynamically favored double-stranded structure. Examples of hybridization 
conditions can be found in the two laboratory manuals referred above ^ambrook 
et al., 1989, supra and Ausubel et al., 1989, supra) and are commonly known in 
the art. In the case of a hybridization to a nitrocellulose filter, as for exampi in th 
well known Southern blotting procedur , a nitrocellulose filter can be incubat d 
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overnight at SSX with a labeled probe in a solution containing 50%formamide, 
high salt (5 x SSC or 5 x SSPE), 5 xDenhardt's solution, 1% SDS, and 100 \ig/m\ 
denatured carrier DNA (i.e. salmon sperm DNA). The non-specifically binding 
probe can then be washed off the filter by several washes in 0.2 x SSC/0.1% SDS 
5 at a temperature which is selected in view of the desired stringency: room 
temperature (low stringency), 42X (moderate stringency) or 65*C (high 
stringency). The selected temperature is based on the melting temperature (Tm) 
of the DNA hybrid (Sambrook et al. 1989, supra). Of course, RNA-DNA hybrids 
can also be formed and detected. In such cases, the conditions of hybridization 
10 and washing can be adapted according to well known methods by the person of 
ordinary skill. Stringent conditions will be preferably used (Sambrook et al.,1989, 
supra). 

Probes or primers of the invention can be utilized with 
naturally occurring sugar-phosphate backbones as well as modified backbones 

15 including phosphorothioates, dithionates, alky! phosphonates and a-nucieottdes 
and the like. Modified sugar-phosphate backbones are generally taught by Miller, 
1988, Ann. Reports Med. Chem. 23:295 and Moran et al., 1987, Nucleic acid 
molecule. Acids Res., 14:5019. Probes or primers of the invention can be 
constructed of either ribonucleic acid (RNA) or deoxyribonucleic acid (DNA), and 

20 preferably of DNA. 

The types of detection methods in which probes can be used 
include Southern blots (DNA detection), dot or slot blots (DNA, RNA), and 
Northem blots (RNA detection). Although less preferred, labeled proteins could 
also be used to detect a particular nucleic acid sequence to which it binds. Other 
25 detection methods include kits containing probes on a dipstick setup and the like. 

Although the present invention is not specifically dependent 
on the use of a label for the detection of a particular nucleic acid sequence, such 
a label might be beneficial, by incr asing the sensitivity of the detection. 
Furthermore, it enables automation (the sam can also be said of detection of 
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proteins using ligands such as antibodies). Probes can be labeled according to 
numerous well known methods (Sambrook et aL, 1989. supra). Non-limiting 
examples of detectable markers include ligands. fluorophores, chemiluminescent 
agents, enzymes, and antibodies. Other detectable markers for use with probes, 
5 which can enable an increase in sensitivity of the method of the invention, include 
biotin and radionucleotides. It will become evident to the person of ordinary skill 
that the choice of a particular label dictates the manner in which it is bound to the 
probe. 

As commonly known, radioactive nucleotides can be 
10 incorporated into probes of the invention by several methods. Non-limiting 
examples thereof include kinasing the 5* ends of the probes using gamma 
ATP and polynucleotide kinase, using the Klenow fragment of Pol I of E. coli in 
the presence of radioactive dNTP (i.e. uniformly labeled DNA probe using random 
oligonucleotide primers in low-melt gels), using the SP6/T7 system to transcribe 
15 a DNA segment in the presence of one or more radioactive NTP, and the like. 

As used herein, "oligonucleotides" or "oligos" define a 
molecule having two or more nucleotides (ribo or deoxyribonucleotides). The size 
of the oligo will be dictated by the particular situation and ultimately on the 
particular use thereof and adapted accordingly by the person of ordinary skill. An 
20 oligonucleotide can be synthetised chemically or derived by cloning according to 
well known methods. 

As used herein, a "primer^ defines an oligonucleotide which is 
capable of annealing to a target sequence, thereby creating a double stranded 
region which can serve as an initiation point for DNA synthesis under suitable 
25 conditions. 

Amplification of a selected, or target, nucleic acid sequence 
may be carried out by a number of suitable methods. See generally Kwoh et al., 
1990, Am. Biotechnol. Lab. 8:14-25. Numerous amplification techniques have 
be n described and can be r adily adapted to suit particular n ds of a person 
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of ordinary skill. Non-limiting examples of amplification techniques include 
polymerase chain reaction (PCR), ligase chain reaction (LCR), strand 
displacement amplification (SDA), transcription-based amplification, the Qp 
replicase system and NASBA (Kwoh et al., 1989. Proc. Natl. Acad.Sci. USA 86, 
5 1 173-1 177; Lizardi et al., 1988. BioTechnology 6:1 197-1202; Maiek et aL, 1994, 
Methods Mol. BioL, 28:253-260; and Sambrook et al.. 1989. supra). Preferably, 
amplification will be candied out using PCR. 

Polymerase chain reaction (PCR) is earned out in accordance 
with known techniques. See. e.g., U.S. Pat. Nos. 4,683.195; 4.683,202; 
10 4.800.159; and 4.965.188 (the disclosures of all three U.S. Patent are 
incorporated herein by reference). In general. PCR involves, a treatment of a 
nucleic acid sample (e.g.. in the presence of a heat stable DNA polymerase) 
under hybridizing conditions, with one oligonucleotide primer for each strand of 
the specific sequence to be detected. An extension product of each primer which 
15 is synthesized is complementary to each of the two nucleic acid strands, with the 
primers sufficiently complementary to each strand of the specific sequence to 
hybridize therewith. The extension product synthesized from each primer can also 
serve as a template for further synthesis of extension products using the same 
primers. Following a sufficient number of rounds of synthesis of extension 
20 products, the sample is analysed to assess whether the sequence or sequences 
to be detected are present. Detection of the amplified sequence may be carried 
out by visualization following EtBr staining of the DNA following gel electrophores. 
or using a detectable label in accordance with known techniques, and the like. For 
a review on PCR techniques (see PCR Protocols, A Guide to Methods and 
25 Amplifications. Michael et al. Eds. Acad. Press, 1990). 

Ligase chain reaction (LCR) is carried out in accordance with 
known techniques (Weiss, 1991, Science 254:1292). Adaptation of the protocol 
to meet the desired needs can be canied out by a person of ordinary skill. Strand 
displacement amplification (SDA) is also carried out in accordance with known 
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techniques or adaptations thereof to meet the particular needs (Walker et al., 
1992, Proc. Natl. Acad. Sci. USA 89:392-396; and Ibid., 1992, Nucleic Adds Res. 
20:1691-1696). 

As used herein, the temn "gene" is well known in the art and 
5 relates to a nucleic acid sequence defining a single protein or polypeptide. A 
''structural gene" defines a DNA sequence which is transcribed into RNA and 
translated into a protein having a specific amino acid sequence thereby giving rise 
the a specific polypeptide or protein. It will be readily recognized by the person of 
ordinary skill, that the nucleic acid sequence of the present invention can be 
10 incorporated into anyone of numerous established kit formats which are well 
known in the art. 

A "heterologous" (i.e. a heterologous gene) region of a DNA 
molecule is a subsegment segment of DNA within a larger segment that is not 
found in association therewith in nature. The tenn tieterologous" can be similarly 
15 used to define two polypeptidic segments not joined together in nature. Non- 
limiting examples of heterologous genes include reporter genes such as green 
fluorescence protein, luciferase, chloramphenicol acetyl transferase, p- 
galactosidase, and the like which can be juxtaposed or joined to heterologous 
control regions or to heterologous polypeptides. 
20 The term "vector^ is commonly known In the art and defines a 

plasmid DNA, phage DNA, viral DNA and the like, which can serve as a DNA 
vehicle into which DNA of the present invention can be cloned. Numerous types 
of vectors exist and are well known in the art. 

The term "expression" defines the process by which a gene is 
25 transcribed into mRNA (transcription), the mRNA is then being translated 
(translation) into one polypeptide (or protein) or more. 

The terminology "expression vector" defines a vector or 
vehicle as described above but designed to enable the expression of an inserted 
sequence following transformation into a host. The cloned gen (inserted 
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sequence) is usually placed under the control of control element sequences such 
as promoter sequences. The placing of a cloned gene under such control 
sequences is often refered to as being operably linked to control elements or 
sequences. 

" Operably linked sequences may also include two segments 

"^at are transcribed onto the same RNA transcript. Thus, two sequences, such 
promoter and a "reporter sequence" are operably linked if transcription 
^^j.nmencing in the promoter will produce an RNA transcript of the reporter 
sequence. In order to be "operably linked" it is not necessary that two sequences 
10 be immediately adjacent to one another. 

Expression control sequences will vary depending on whether 
the vector is designed to express the operably linked gene in a prokaryotic or 
eukaryotic host or both (shuttle vectors) and can additionally contain 
transcriptional elements such as enhancer elements, termination sequences, 
15 tissue-specificity elements, and/or translational initiation and termination sites. 

Prokaryotic expressions are useful for the preparation of large 
quantities of the protein encoded by the DNA sequence of interest. This protein 
can be purified according to standard protocols that take advantage of the 
intrinsic properties thereof, such as size and charge (i.e. SDS gel electrophoresis. 
20 gel filtration, centrifugation, ion exchange chromatography...). In addition, the 
protein of interest can be purified via affinity chromatography using polyclonal or 
monoclonal antibodies. The purified protein can be used for diagnostic or 
therapeutic applications. 

The DNA construct can be a vector comprising a promoter 
25 that is operably linked to an oligonucleotide sequence of the present invention, 
which is in turn, operably linked to a heterologous gene, such as the gene for the 
luclferase reporter molecule. "Promoter" refers to a DNA regulatory region 
capable of binding directly or indirectly to RNA polymerase in a cell and initiating 
transcription of a downstream (3* direction) coding sequence. For purposes of 
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the present invention, the promoter is bound at its 3' terminus by the transcription 
initiation site and extends upstream (5' direction) to include the minimum number 
of bases or elements necessary to initiate transcription at levels detectable above 
background. Within the promoter will be found a transcription initiation site 
5 (conveniently defined by mapping with S1 nuclease), as well as protein binding 
domains (consensus sequences) responsible for the binding of RNA polymerase. 
Eukaryotic promoters will often, but not always, contain "TATA" boses and 
"CCAT* boxes. Prokaryotic promoters contain Shine-Dalgamo sequences in 
addition to the -10 and *35 consensus sequences. 
10 As used herein, the designation "functional derivative" 

denotes, in the context of a functional derivative of a sequence whether an 
nucleic acid or amino acid sequence, a molecule that retains a biological activity 
(either function or structural) that is substantially similar to that of the original 
sequence. This functional derivative or equivalent may be a natural derivative or 
15 may be prepared synthetically. Such derivatives include amino acid sequences 
having substitutions, deletions, or additions of one or more amino acids, provided 
that the biological activity of the protein is conserved. The same applies to 
derivatives of nucleic acid sequences which can have substitutions, deletions, or 
additions of one or more nucleotides, provided that the biological activity of the 
20 sequence is generally maintained. When relating to a protein sequence, the 
substituting amino acid as chemico-physical properties which are similar to that 
of the substituted amino acid. The similar chemico-physical properties include, 
similarities in charge, bulkiness, hydrophobicity, hydrophylicity and the like. The 
term "functional derivatives" is intended to include "fragments", "segments'*. 
25 "variants", "analogs" or "chemical derivatives" of the subject matter of the present 
invention. 

Thus, the tenn "variant" refers herein to a protein or nucleic 
acid molecule which is substantially similar in structure and biological activity to 
th protein or nucleic acid of the present invention. 



wo 01/18544 




PCT/CAOO/01052 



18 



The functional derivatives of the present invention can be 
synthesized chemically or produced through recombinant DNA technology. All 
these methods are well known in the art. 

As used herein, "chemical derivatives" is meant to cover 
5 additional chemical moieties not normally part of the subject matter of the 
invention. Such moieties could affect the physico-chemical characteristic of the 
derivative (i.e. solubility, absorption, half life and the like, decrease of toxicity). 
Such moieties are examplified in Remington's Pharmaceutical Sciences (1980). 
Methods of coupling these chemicai-physical moieties to a polypeptide are well 
10 known in the art. 

The term "allele" defines an alternative form of a gene which 
occupies a given locus on a chromosome. 

As commonly known, a "mutation" is a detectable change in 
the genetic material which can be transmitted to a daughter cell. As well known, 
15 a mutation can be, for example, a detectable change in one or more 
deoxyribonucleotide. For example, nucleotides can be added, deleted, substituted 
for, inverted, or transposed to a new position. Spontaneous mutations and 
experimentally induced mutations exist. A mutant polypeptide can be encoded 
from a mutant nucleic acid molecule. In addition, mutant proteins can be 
20 produced through aberrant events during replication, transcription and/or 
translation. Frameshifting (the switching from a particular reading frame to 
another) is such a mechanism that can modify the sequence of the translated 
protein. 

As used herein, the term "purified" refers to a molecule having 
25 been separated from a cellular component. Thus, for example, a "purified protein" 
has been purified to a level not found in nature. A "substantially pure" molecule 
is a molecule that is lacking in all other cellular components. 

As used h rein, the terms "molecule", "compound", "agent", or 
"ligand" are us d interchang ably and broadly to refer to natural, synth tic or 
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semi-synthetic molecules or compounds. The temi "molecule" therefore denotes 
for example chemicals, macromolecuies, cell or tissue extracts (from plants or 
animals) and the like. Non limiting examples of molecules include nucleic acid 
molecules, peptides, antibodies, carbohydrates and phamiaceuticaUherapeutical 
5 agents. The agents can be selected and screened by a variety of means 
including random screening, rational selection and by rational design using for 
example protein oriigand modelling methods such as computer modelling. The 
terms "rationally selected" or "rationally designed" are meant to define, for 
example, compounds which have been chosen based on the configuration of the 
1 0 polyalanine domains of the present invention. As will be understood by the 
person of ordinary skill, macromolecuies having non-naturally occurring 
modifications are also within the scope of the term "molecule". For example, 
peptidomimetics, well known in the pharmaceutical industry and generally 
referred to as peptide analogs can be generated by modelling as mentioned 
1 5 above. Similarly, in one embodiment, the polypeptides of the present invention 
can be modified to enhance or decrease their stability. It should be understood 
that in most cases this modification should not alter the biological activity of the 
polyalanine domain (its toxic effect or INI localization property). The molecules 
identified in accordance with the teachings of the present invention have a 
20 therapeutic value in diseases or conditions in which the physiology or 
homeastasis of the cell and/or tissue is compromised by a production of 
polyalanine-containing proteins or polypeptides. Altematively. the molecules 
identified in accordance with the teachings of the present invention find utility in 
the development of more efficient molecule to lower and/or abrogate the toxicity 
25 of such proteins and/or to reduce or eliminate the production of such mutant 
proteins. It will be understood that agents can be screened, in accordance with 
the present invention, with libraries of compounds, using for example automated 
scr ening methods (e.g. array technologies). 
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The level of gene expression of the reporter gene (e.g. the 
level of luciferase, or p-gal, produced) within the treated cells can be compared 
to that of the reporter gene in the absence of the molecules(s). The difference 
between the levels of gene expression indicates whether the molecule(s) of 
5 interest agonizes the aforementioned interaction. The magnitude of the level of 
reporter gene product expressed (treated vs. untreated cells) provides a relative 
indication of the strength of that molecule(s) as an agonist. The same type of 
approach can also be used in the presence of an antagonist(s). Thus, 
modulators of the production of such proteins can be identified and selected. 
10 Non-limiting examples of modulators in accordance with the present invention 
include frameshift mutants or suppressors, relievers of codon rareness (I.e. 
relieving the limitation of rarecodons which favor frameshifting events), agents 
which degrade polyalanine stretches, tRNAs. tRNA suppressors and the like. 
One skilled in the art will realize that the assays to identify compounds that 
15 modulate frameshifting of CAG repeats and the like, could be canied out using 
other repeats as well as other genes known to promote frameshifting. Such 
genes are known in the art. 

The present invention also provides antisense nucleic acid 
molecules which can be used for example to modulate the expression of the 
20 mutant proteins of the present invention. An antisense nucleic add molecule 
according to the present invention refers to a molecule capable of forming a 
stable duplex or triplex with a portion of its targeted nucleic add sequence (DMA 
or RNA). The use of antisense nucleic acid molecules and the design and 
modification of such molecules is well known in the art as described for example 
25 in WO 96/32966, WO 96/11266, WO 94/15646, WO 93/08845 and 
USP 5,593,974, Antisense nucleic acid molecules according to the present 
invention can be derived from the nucleic acid sequences and modified in 
accordance to well known methods. For example, some antisense molecules can 
be d signed to be more resistant to degradation to increase their affinity to their 
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targeted sequence, to affect their transport to chosen cell types or ceil 
compartments, and/or to enhance their lipid solubility bu using nucleotide analogs 
and/or substituting chosen chemical fragments thereof, as commonly known in 
the art. 

5 Alternatively, an indicator cell in accordance with the present 

invention can be used to identify antagonists. For example, the test molecule or 
molecules are incubated with the host cell in conjunction with one or more 
agonists held at a fixed concentration. An indication and relative strength of the 
antagonistic properties of the molecule(s) can be provided by comparing the level 

10 of gene expression in the indicator cell in the presence of the agonist, in the 
absence of test molecules vs in the presence thereof. Of course, the antagonistic 
effect of a molecule can also be determined in the absence of agonist simply by 
comparing the level of expression of the reporter gene product in the presence 
and absence of the test molecule(s). 

15 It shall be understood that the Tfn vivo" experimental model 

can also be used to carry out an 7n vtfro'' assay. For example, cellular extracts 
from the indicator cells can be prepared and used in one of the aforementioned 
"/n wfro" tests. 

As used herein the recitation "indicator cells" refers to. for 
20 example, cells that express a fusion protein comprising apolyalanine segment 
(e.g. a "CAG" repeat) and an identifiable or selectable phenotype or characteristic 
which enables an assessment of the level of fusion protein expression (e.g. a 
reporter protein). Such indicator cells can be used in the screening assays of the 
present invention. In certain embodiments, the indicator cells have been 
25 engineered so as to express a chosen derivative, fragment, homolog, or mutant 
of a repeat. It should be understood that the repeats should not be limited to CAG 
repeats. Indeed, GCG repeats can also be used. In addition, the invention should 
not be limited to polyalanine repeats, since the present invention provides for th 
testing of polyserine fragments and other polyamino acids which could b 
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expressed from a frameshifting event. The cells can be yeast cells or higher 
eukaryotic cells such as mammalian cells (WO 96/41169). Preferably, the 
indicator cells are higher eukaryotic cells. Non-limiting examples of such ceils 
and vectors are exemplified herein below (i.e. Examples 2-4). In one particular 
5 embodiment, the indicator cell could be used to test a compound or a library 
thereof. 

As exemplified herein below in one embodiment, a 
polyalanine polypeptide segment of the present invention is provided as a fusion 
protein. The design of constructs therefor and the expression and production of 

10 fusion proteins are exemplified herein and are well known in the art (Sambrook 
et al., 1989, supra; and Ausubel et aL, 1994. supra), 

Non limiting examples of such fusion proteins include a 
hemaglutinin fusions (HA) and Gluthione-S-transferase (GST) ftjsions. In certain 
embodiments, it might be beneficial to introduce a protease cleavage site 

1 5 between the two polypeptide sequences which have been fused. Such protease 
cleavage sites between two heterologously fused polypeptides are well known in 
the art. 

In certain embodiments, it might also be beneficial to 
introduce a linker (commonly known) between the repeat segment of the protein 

20 and the heterologous polypeptide portion (e.g. reporter protein portion). Such 
fusion protein find utility in the assays of the present invention as well as for 
purification purposes, detection purposes and the like. 

For certainty, the sequences and polypeptides useful to 
practice the invention include without being limited thereto mutants, homologs. 

25 subtypes, alleles and the like. It shall be understood that generally, the 
sequences of the present invention should encode a functional (albeit defective) 
repeat domain. It will be clear to the person of ordinary skill that whether a repeat 
domain of the present invention, variant, derivative, or fragm nt thereof r tains 
its function In nabling a concentration of the protein containing same in INI or 
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triggering toxicity in cells or animals, can be readily determined by using the 
teachings and assays of the present invention and the general teachings of the 
art. 

As exemplified herein below, the repeat domains of the 
5 present invention can be modified, for example by /n vitro mutagenesis, to dissect 
the structure-function relationship thereof and permit a better design and 
identification of modulating compounds. However, some derivative or analogs 
Having lost their biological function may still find utility, for example for raising 
antibodies. These antibodies could be used for detection or purification purposes. 
10 In addition, these antibodies could also act as competitive or non-competitive 
inhibitor and be found to be modulators of the biological activity of the repeat 
domain. 

A host cell or indicator cell has been Iransfected" by 
exogenous or heterologous DNA (e.g. a DNA construct) when such DMA has 
15 been introduced Inside the cell. The transfecting DNA may or may not be 
integrated (covalently linked) into chromosomal DNA making up the genome of 
the cell. In prokaryotes, yeast, and mammalian cells for example, thetransfecting 
DNA may be maintained on a episomal element such as a plasmid. With respect 
to eukaryotic cells, a stably transfected cell is one in which the transfecting DNA 
20 has become integrated into a chromosome so that it is inherited by daughter cells 
through chromosome replication. This stability is demonstrated by the ability of 
the eukaryotic cell to establish cell lines or clones comprised of a population of 
daughter cells containing the transfecting DNA. Transfection methods are well 
known in the art (Sambrook et al.. 1989, supra; Ausubel et aL. 1994 supra). 
25 In general, techniques for preparing antibodies (including 

monoclonal antibodies and hybridomas) and for detecting antigens using 
antibodies are well known in the art (Campbell, 1984, In "Monoclonal Antibody 
T chnology: Laboratory Techniques in Biochemistry and Molecular Biology". 
Elsevier Science Publisher, Amsterdam, The Netheriands) and in Hariow et al.. 
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1988 (in: Antibody- A Laboratory Manual, CSH Laboratories). The present 
invention also provides polyclonal, monoclonal antibodies, or humanized versions 
thereof, chimeric antibodies and the like which are specific to the repeat domains 
of the present invention. 
5 From the specification and appended claims, the term 

therapeutic agent should be taken in a broad sense so as to also include a 
combination of at least two such therapeutic agents. Further, the therapeutic 
agent according to the present invention can be introduced into individuals in a 
number of ways. The therapeutic agent can also be delivered through a vehicle 
1 0 such as a liposome, which can be designed to be targeted to a specific cell type, 
and engineered to be administered through different routes. Having shown that 
a polyalanine segment is toxic to cells, the present invention provides the means 
to trigger toxicity in cells by expressing thereinto or delivering thereto a 
polyalanine-containing protein (or polyalanine-encoding nucleic acid). In 
1 5 accordance with known methods, a chosen cell population could be targeted. 

For administration to humans, the prescribing medical 
professional will ultimately determine the appropriate form and dosage for a given 
patient, and this can be expected to vary according to the chosen therapeutic 
regimen (i.e. DNA construct, protein, cells), the response and condition of the 
20 patient as well as the severity of the disease. 

Composition within the scope of the present invention should 
contain the active agent (i.e. fusion protein, nucleic acid, and molecule) in an 
amount effective to achieve the desired therapeutic effect while avoiding adverse 
side effects. Typically, the nucleic acids, fusion proteins and molecules in 
25 accordance with the present invention can be administered to mammals (i.e. 
humans) in doses ranging from 0.005 to 1 mg per kg of body weight per day of 
the mammal which is treated. Pharmaceutically acceptable preparations and 
salts of the active agent are within the scope of the present invention and ar well 
known in the art (Remington's Pharmaceutical Science. 16th Ed.. Mack Ed.). For 
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the administration of polypeptides, antagonists, agonists and the like, the amount 
administered should be chosen so as to avoid adverse side effects. The dosage 
will be adapted by the ciinician in accordance with conventional factors such as 
the extent of the disease and different parameters from the patient. Typically, 
5 0.001 to 50 mg/kg/day will be administered to the mammal. 

The present invention also relates to a kit for diagnosing a 
disease or condition associated with the expression of a repeat domain, encoding 
for example a polyalanine stretch, or a predisposition to contracting same 
comprising a nucleic acid, a protein or a iigand in accordance with the present 

10 invention. For example, a compartmentalized kit in accordance with the present 
invention includes any kit in which reagents are contained in separate containers. 
Such containers include small glass containers, plastic containers or strips of 
plastic or paper. Such containers allow the efficient transfer of reagents from one 
compartment to another compartment such that the samples and reagents are not 

1 5 cross-contaminated and the agents or solutions of each container can be added 
in a quantitative fashion from one compartment to another. Such containers will 
include a container which will accept the test sample (DNA protein or cells), a 
container which contains the primers used in the assay, containers which contain 
enzymes, containers which contain wash reagents, and containers which contain 

20 the reagents used to detect the extension products. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Having thus generally described the invention, reference will 
now be made to the accompanying drawings, showing by way of illustration a 
preferred embodiment thereof, and in which: 
25 Figure 1 shows the Western blot analysis of lymphoblastoid 

ceils from controls and M JD patients, a, Schematic representation of the MJD- 
Ala protein that results from a frameshift in the CAG tract showing the new C- 
tenminus (italicized; used to raise the FS1 and FS2 antibodies). West m blots of 
two control lymphoblastoid cell lines (cLCL) and four MJD lymphoblastoid cell 
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lines (MJDLCL) immunoprobed with FS1 (b), anti-ataxin-3 (c). 1C2 (d) and FS1 
pre-immune senjm (e). Arrow indicates tlie threshold between stacking and 
resolving portions of the gel. Panels b-d represent serial probing of a single 
membrane. 

5 Figure 2 shows the immunocytochemical detection of 

intranuclear deposits in lymphoblastoid cells. Immunocytochemistry of control 
LCL versus MJD LCL: absence of INI in control LCL probed with FS1 (a), and 
antl-ubiquitin (c), and detection of INI in MJD LCL with FS1 (b) and antiiJbiquitin 
(d). e, Immune detection of MJD LCL with FS1 pre-immune serum. For all 
10 panels, the magnification before publication is 400x (left) and 1000x (right). 
These results have been replicated in three separate experiments. 

Figure 3 shows the immunohistochemical detection of INI in 
MJD pontine neurons. Immunoprobing with FS1 antiserum in MJD pons (a) and 
control pons (b); immunoprobing with anti-ubiquitin in MJD pons (c) and control 
15 pons(d). INI in pontine neurons are indicated by anrowheads. Double labeling 
immunofluorescence analysis of MJD pons showing ubiquitin-labeled INI (e, h) 
and FS1 -labeled INI (f, i), and the composite image of bothlabelings (g. j). For all 
panels, the magnification before publication is lOOOx, before reproduction. 

Figure 4 shows the constructs used in the transfection 
20 experiments. All constructs, with the exception of pM JD1 1 . represent full-length 
MJD-1, Solid black boxes indicate the repeat portion of the constructs. 
Staggered ends indicate that EGFP will only be expressed if aframeshift occurs. 
Encircled blown up detail of pMJDI is also present in pMJD2 and pMJDS; details 
of pMJD5 are present in pMJD6; details of pMJD7 are present in pMJD8 and 
25 details of pMJD9 are the same as pMJDIO. 

Figure 5 shows the transfection experiments with different 
M JD/EGFP constmcts. a, DNA sequence of the clones with EGFP out of frame 
(pMJD1, pMJD2 and pMJDS) and b, with EGFP inglutamine frame (pMJD4), both 
(a, b) showing the predicted amino acid sequence, c-e, Fluorescence at 72 hours 
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of COS-7 cells transfected with pMJD1 (c). pMJD2 (d) and pM JD3 (e) where the 
MJD{CAG)r/EGFP fusion protein is translated only when frameshifts to GCA- 
polyAla occur f . COS-7 cells transfected with pMJD4 where EGFP is in frame 
with CAG-Gln. g, COS-7 cells transfected with the vector pEGFP-N1. h, 
5 perinuclear fluorescent aggregates observed in cells transfected with pM JD3. 
Pictures of sections c. d. e, f and g were taken at 25 x magnification for a fixed 
exposure time of 60 seconds before reproduction; picture shown in h, is at 400 
X before reproduction, i-k, Westem blots of protein isolated from cells shown in 
c, d. e, f, g and from mock-transfected cells immunoprobed with 1C2 (i), anti- 
10 ataxin-3 (j) and anti-HA (k). Arrows on the right of panel j indicate the proteins 
detected. Arrowhead on panel k indicates threshold between stacking and 
resolving portions of the gel. Rpts stands for repeats. 

Figure 6 shows the time-course immunocytochemical analysis 
of COS-7 cells transfected with constructs encoding ataxin-3 with either apolyAla 
15 or a polyGIn tract. Immunoprobing with anti-HA antibody at time-points 8 hours: 
(a) pMJD7. (b) pMJD9, (c) pMJD8, (d) pMJDIO and (e) vector pEGFP-NI alone; 
20 hours: (f) pMJD7, (g) pMJD9. (h) pMJDS, pMJDIO and (j) vector pEGFP-NI 
alone; 24 hours: (k) pMJD7, (I) pMJD9, (m) pMJDS. (n) pMJDIO and (o) vector 
pEGFP-NI alone; 48 hours: (p) pMJD7, (q) pMJD9. (r) pMJD8, (s) pMJDIO and 
20 (t) vector pEGFP-N1 alone. For all panels pictures were taken at lOOOx 
magnification, before reproduction. 

Figure 7 shows the westem analysis of transfected COS-7 
cells (in figure 5). Blots were probed with (a) anti-HA, (b) 1C2. Arrow indicates 
threshold between stacking and resolving portions of the gel. Cells were 
25 harvested at 72 hours after transfection. 

Figure 8 shows the immunocytochemical analysis of COS-7 
cells transfected with a truncated polyAla-encoding construct. Immunoprobing 
with anti-HA antibody in: (a) cells transfected with 42A and (b) mocWransfected 
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cells. For all panels pictures were taken at 1000x magnification, before 
reproduction. 

Other objects, advantages and features of the present 
invention will become more apparent upon reading of the following non-restrictive 
5 description of preferred embodiments with reference to the accompanying 
drawing which is exemplary and should not be interpreted as limiting the scope 
of the present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

The data herein presented strongly suggest the fact that (1) 
10 frameshifts occur within CAG repeats and are responsible for the production of 
alanine-containing proteins, and (2) these proteins accumulate as toxic 
aggregates. Detection of the hypothetical peptide in lymphoblastoid and neuronal 
cells of M JD patients and not in controls, and the production of green 
fluorescence preferentially in cells transfected with MJD-I bearing large CAG 
15 tracts with an out of frame EGFP, supports the occurrence of rareframeshifts 
during transcription and/or translation of long CAG tracts resulting from the use 
of the alternative GCA/Ala reading frame. The relatively small proportion of 
frameshifted product may explain the absence of loss-of-function of the protein 
and the relatively late-onset of these diseases. This model of slow accumulation, 
20 due to relatively rare frameshift events, better explains the late-onset nature of 
these diseases given that, in light of the high expression of these proteins in 
affected brain regions, a glutamine toxicity model would be expected to result in 
much eariier cell death and disease onset^^'^. 

Slippage into the third possible frame, AGC/Ser, may also be 
25 occurring. However, the absence of diseases associated with tracts ofpolyserine 
and the physical nature of alanine polymers, resulted in an exclusive focus on the 
GCA/Ala frame. While the relative frequencies of different frameshifts remain 
unknown, the fact that mosttranslational frameshift errors cause a +2 shift in the 
frame^ suggests that GCA/Ala may be the more frequent of the mutant species. 
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The present invention also assessed whether polyAia tracts 
are toxic and may be the initiating event in the formation of the INI seen in 
expanded CAG tract diseases. The evidence presented herein shows that in 
cells transfected with the CAG/3ln constructs, frameshifting into the alanine frame 
5 is progressive and frameshifted products are slowly accumulating in the nucleus 
as INI (see Fig. 5). Given the expected low frequency of firameshifts, the finding 
of frameshifted protein in INI as early as 12 hours aftertransfection argues that 
poiyAla accumulation is a very early event in the formation of these structures. In 
fact, in cells transfected with as few as 14CAGs. frameshifts are occurring and 
10 polyalanine-containing protein is accumulating in the nucleus as INI. This finding 
is consistent with a recent report by Perez et al. showing that transfection of 
normal sized CAG repeats leads to INP. In contrast with the slow, progressive 
accumulation detected for the CAGASIn constructs, transfection with the GCA/Ala 
constructs results in eariy and rapid accumulation of alanine-containing product. 
15 In this case the cells seem unable to cope with the presence of the toxic product 
resulting in an earlier and much more severe phenotype. The pattem of 
expression of polyAla products, consisting of one major juxtanuclear aggregate 
per cell and several smaller inclusions, is similar to that found inaggresomes. 
structures that form when the capacity of the proteasome degradation pathway 
20 is exceeded^V This seems to indicate that the direct expression ofpolyalanine 
in cells is likely to be extremely toxic. The presence of more "classical" INI in cells 
expressing CAG/GIn constructs is thought to result from rareframeshift events 
that produce alanine-containing protein, which slowly accumulates in the nucleus 
as aggregates. This model therefore better depicts what may be happening in 
25 diseased tissue cells. 

The proposed model of toxic aggregates resulting from 
frameshifts into the alanine frame is consistent with previous experiments 
performed with MJD, as well as other exp-CAG diseas s. For example, in 
Huntington disease (HD) and MJD, INI and protein accumulation are most clearty 
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detected using antibodies against epitopes to the N-temiinus side of the CAG 
repeat^*^, likely fc>ecause frameshifting will taincate the protein shortly after the 
repeat. This model could also explain the correlation observed in all CAG triplet 
diseases between the size of the repeat and the severity of the symptoms^, it is 
5 suggested that if the frameshift errors occur randomly, the longer the repeat the 
more frequently such errors would arise. This is supported by the observations 
that: (1) no accumulation was yet detected in the lymphoblasts of a MJD patient 
with the shortest (CAG)67 mutation; (2) fluorescence increased with the size of the 
CAG repeat in the transfection experiments; and (3) at any time-point, more than 
10 twice as many cells have inclusions containing frameshifted species when 
transfected with a (CAG)82 construct than with a (CAG) 15 construct. 

Another puzzling finding previously reported by other groups, 
and which can be explained by the model of frameshifts into the altemate alanine 
frame and the production of prematurely truncated proteins herein proposed, is 
1 5 the presence of short deavage products in inclusions in HO and spinal and bulbar 
muscular atrophy (SBMA)^^^. These short products seem to be too small to be 
the result of caspase-3 cleavage, but are of a size consistent with premature 
termination of transcription or translation due to the generation of an eariy STOP 
codon in the alanine shifted frame. 
20 The model presented herein might also explain the 

observation that while both CAA and CAG codons encode glutamine, only 
uninterrupted CAG tracts cause disease. In spinocerebellar ataxia type 1 (SCA1), 
for polyglutamine tracts of similar length the presence of a CAAcodon interrupting 
a CAG tract is used to differentiate a normal from a disease causing allele^®* 
25 These observations, while difficult to explain if polyglutamine is toxic, are 
predicted by the proposed polyAla nuclear toxicity model of the present invention, 
where interruption of a repeat sequence may lead to more stable transcription or 
translation. 
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Finally, the present invention may predict a similar pathogenic 
mechanism in the other diseases caused by expanded CAG tracts. It has been 
shown here that constructs containing almost exclusively a GCA tract are toxic 
to cells and lead to the formation of aggregates. This seems to indicate that the 
5 full MJD protein is not necessary to obtain a disease phenotype and that the 
presence of an expanded CAG repeat within any protein may be sufficient to 
cause disease. This would support the contention that the protein or gene 
causing the disease is indeed irrelevant to the disease process (Hardy et al. 
1998). In addition, it is possible thatpolyAla accumulation may play a role in 

10 aging of certain cell types. For example, if frameshifts causing polyAla tracts 
occur with all CAG repeat-containing genes even in the "nonnal range" as is seen 
in the 14 CAG- and 37 CAG-containing constmcts of the present invention, very 
slow accumulation of polyAla polymers may occur, leading to cell death. This 
could explain the observation that fewer large repeats are found in healthy elders 

15 compared to younger controls ^. The discovery of expanded polyalanine 
domains in three diseases^ ^-^^^ and the fact that CAG repeat frameshift mutant 
proteins can cause disease if they code for polyAla, pinpoints this homopolymer 
as a potential target for drug design. Polyalanine nuclear toxicity may well be a 
frequent cause of premature cell death in different tissues. 

20 The present invention is illustrated in further detail by the 

following non-limiting examples. 

EXAMPLE 1 

Frameshifts occur in CAG tracts resulting in poiyalanine- 
containing proteins 

25 Two immunopurified polyclonal antisera were raised against 

a synthetic peptide corresponding to the 12 last amino acids predicted to result 
from poly^a tract produdng frameshifts within the CAG repeat of theAf JO-t gene 
(FS1 and FS2)(Fig. la). This new amino acid sequence has no homology to any 
known protein. Both antisera detected high molecular weight aggregates in the 
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Stacking gels in Western blots of total lymphoblast protein from three MJD cases 
(Fig. 1b). In the two controls and in the MJD patient with the shortest (CAOs? 
repeat no aggregates were observed (Fig. 1b). Both FS antibodies also 
recognize, in all samples, an 83kD protein of unknown origin. In order to test if the 
5 signal detected in the stacking gels was the putative MJD-Ala protein, the same 
blots were probed with antisera raised against MJD protein epitopes (anti-ataxin- 
3; Fig. 1c)^^ and against polyGln domains (1C2; Fig. 1d)^^ These results are 
compatible with the presence of the predicted hybrid protein in the stacking gel. 
An anti-ubiquitin polyclonal antibody (data not shown) also detected the 
10 accumulations, which is consistent with previous reports^- ^. These 

accumulations are compatible with those reported by Paulson et al. who studied 
cells expressing mutant CAG tract expanded MJD-I using an antibody raised 
against the expressed ataxin-3 fusion proteinsV 

To test if the MJD-Ala protein accumulates in nuclei, 
1 5 immunocytochemistry on lymphoblasts from four controls and three MJD patients 
was performed. As Fig. 2 depicts. FS1 positive INI are observed in MJD cell lines 
and not In controls (Fig. 2a. b). Similar to the Western blot results, the anti- 
ubiquitin antibody (Fig. 2c. d) and the anti-ataxin-3 antibody (data not shown) also 
detect INI in MJD cell lines. 
20 In an attempt to show the presence of the frameshifted 

species in affected MJD brain regions, immunohistochemical FS1 and anti- 
ubiquitin staining in diseased and control pons, a region known to be affected in 
MJD was performed (Fig. 3). Both antibodies stain intranuclear structures in 
neurons of this MJD brain area (Fig. 3a. c). whereas pontine neurons in control 
25 brain have no INI (Fig. 3b, d). In immunofluorescence studies, it was shown that 
the frameshifted product colocalizes with ubiquitin in INI of pontine neurons (Fig. 
3e, f. g. h, i and j), suggesting that the intranuclear structures detected by both 
antibodies are the same. 
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EXAMPLE 2 

Frameshifts into alanine frame occur preferentially 
with expanded CAG tracts 

To test if frameshifts producing GCA/polyAla occur 
5 preferentially within expanded CAG tracts, an in vitro system to examine the effect 
of CAG repeat length on the frequency of frameshifts was designed. COS-7 cells 
were transfected with constructs bearing the full length WJD-t sequence with 
either a (CAG),4, {CAG)^y or (CAG)b2 repeat fused out of frame to theEGFP gene 
and driven by the CMV promoter (Fig. 4 and Fig. 5a). Whileframeshifts would 
10 occur with all CAG tracts, it was hypothesized that they would occur more 
frequently within larger repeats. The pMJD1, pMJD2 and pMJD3 constructs 
would yield EGFP-containing fusion proteins only if a frameshift occurs and 
produces the GCA/polyAla reading frame and the MJD-Ala protein. At 72 hours, 
cells transfected with pMJD2 and, especially pM JDS. showed green fluorescence 
15 (Fig. 5c. d. e). Green fluorescence was observed within 24 houi^ in the positive 
control cells transfected with the pEGFP-N1 vector alone or pMJD4 construct, 
where the EGFP coding sequence was in the glutamine frame in MJD-I (Fig. 5f, 
g). At 96 hours, and at higher magnification, frequent EGFPpositive perinuclear 
inclusions were observed in cells transfected with the pMJD3 construct (82 
20 CAGs) but not in the constmct with 14 CAGs, and rarely in cellstransfected with 
the 37 CAG construct (Fig. 5h). These perinuclear inclusions are similar to those 
found in cell culture models of MJD. as well as other CAG tract disorders, such 
as Huntington disease^^. 

Western blots of protein extracted from the transfected cells 
25 were probed to confimn this interpretation (Fig. 5i. j). While 1C2 detects only the 
expanded MJD gene products bearing either 37 or 82 polyglutamine repeats in 
cells transfected with pMJD2 and pMJD3 respectively (Fig. 5i), anti-ataxin-3 
detects all three different size polyglutamine-tract containing gene products (Fig. 
5j), as expected. With both antibodies, protein accumulation in the wells for 
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pM JDS and pM JD4 was detected. In order to further determine the nature of the 
accumulated protein seen for pMJDS, the pMJDI and pMJDS constructs were 
modified by adding epitopes for FS1 and HA in the alanine frame (pMJDS and 
pMJD6, Fig. 4). Western blots of cells transfected with these constructs were 
5 probed with anti-HA. No signal for the 14 CAG-bearing pMJD5 construct was 
detected, but a band corresponding to aggregated protein was detected at the top 
of the gel for pMJD6, showing that with 82 repeats, frameshifts are occurring. The 
absence of a band corresponding to the predicted size of the ataxin-3/HA protein 
(Fig. 5K) indicates that all frameshifted protein is accumulating as insoluble 
10 aggregates. These experiments further demonstrate thatframeshrfts do occur 
and that their frequency increases with the size of the CAG repeat. 

It shall be recognized by the skilled artisan that the in vitro 
system described above can be modified at will and still enable a determination 
of the frameshifting frequencies. Non-limiting examples of such modifications 
15 include the use of longer or shorter CAG-tracts, different reporter gene (i.e. 
luciferase) and different epitopes. In addition, such systems could be used to 
screen for drugs or compounds which modulate frameshifting and/or affect the 
level of INI formation and/or polyalanine-protein aggregation and/or cell toxicity. 

It should also be recognized that the in vivo methods shown 
20 above (and others) could also be used to screen drugs or compounds which can 
modulate the level of polyalanine formation, INI formation and the like. In 
addition, these in vivo methods (and others) could be used to validate the effect 
of compounds or drugs identified in an in vitro assay. 
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EXAMPLE 3 
Polyalanine-containing proteins are present in 
aggregates and are toxic to transfected COS-7 cells 
5 in a time-dependant manner 

It has been shown thatframeshifting into the alternate alanine 
frame occurs both in vitro and in vivo. To detennine if these products are toxic 
or are simply harmless byproducts with no real consequences to the cell, a new 
set of MJD'1 constructs where the reading frame immediately before the CAG 
10 repeat was mutated to code for a polyalanine stretch was designed. These 
constructs, pMJD9 and pMJDIO, contained stretches of 14 GCA repeats and 82 
GCA repeats respectively, and were fused in frame to the EGFP gene (Fig. 4). A 
HA tag was also added at the COOH-terminus in frame with the GCA stretch. 
COS-7 cells were transfected with these constructs, and with pMJD7 and pMJD8, 
15 which contained 14 CAG repeats and 82 CAG repeats respectively, as well as 
EGFP fused in frame with the CAG tracts. In addition, a HAepitope was also 
added at the COOH-terminus in frame with the GCA tracts, and so only 
detectable if frameshifts occurred (Fig. 4). 

In a time-course experiment using anti-HA antibody as a 
20 probe to detect only protein frameshifted to the alanine frame, cells were collected 
and immunostained cells at 8. 12, 16, 20, 24. 48 and 72 hours. Cellstransfected 
with the CAG/GIn constructs pMJD7 and pMJD8 showed faint background 
staining at 8 hours (Fig. 6a, c). Positive signal was detected for these two 
constructs at 12 hours (not shown) in the form of intranuclear inclusions (typically 
25 one or two per nucleus). At 20 hours, nuclei of cellstransfected with pMJD7 and 
pMJD8 contained inclusions, but were morphologically normal (Fig. 6f, h). At 24 
hours, cells transfected with the shorter construct remained morphologically 
normal (Fig. 6k), but cellstransfected with the construct bearing 82 CAG repeats 
started showing some perinuclear and cytoplasmic inclusions in addition to the 
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intranuclear aggregates (Fig. 6m). At tinis time-point about 85% oftransfected 
cells have inclusions when transfected with pMJDS, whereas only 40% of 
transfected cells show inclusions when transfected with pMJD7, suggesting that 
frameshifts occur with both constructs, but more frequently for the longer repeat. 
5 Nuclei of cells stained at 48 hours showed some membrane disintegration with 
both constructs, but cells containing the longer CAG repeat also had perinuclear 
and cytoplasmic inclusions (Fig. 6p, r). indicating a more severe phenotype. It is 
useful to reiterate that probing cells transfected with the CAGGIn constructs with 
anti*HA will only detect frameshifled protein. These species are detected early 

10 in the transfection and only as INI. suggesting that, despite the probable rarity of 
frameshifts, they are producing proteins that accumulate as highly insoluble INI. 

Inclusions for both GCA/Ala constructs (14A and 82A) as 
early as 8 hours after transfection were detected (Fig. 6b, d). Typically, cells have 
one major perinuclear or cytoplasmic inclusion and abnormal nuclear 

15 morphology. The cellular phenotype progresses rapidly with time in cells 
transfected with either pMJD9 (Fig. 6g, I, and q) or pMJDIO (Fig. 6i, n, s), and 
was extremely severe when compared with the CAG/GIn counterparts of the 
same repeat size (for example: compare panels m and n in Fig. 6). Cells 
transfected with the GCA/Ala constructs showed abnormal nuclear structure and 

20 aggregate formation mainly in the cytoplasm, usually with one majorjuxtanuclear 
inclusion and what appears to be cytoskeletal reorganization. Similar results 
were obtained in cells transfected in parallel with the same constructs but probed 
with FS1 antiserum (results not shown). At all time-points cells transfected with 
the pEGFP*N1 vector alone showed only background staining and were devoid 

25 of inclusions (Fig. 6e, j, o, t). 

Western blots of protein extracted from the transfected cells 
were performed to investigate the nature of the inclusions found. Probing the 
blots with anti-HA det cted a signal for both GCA/Ala constructs (pMJD9 and 
pMJDIO), where the HA tag is in the main reading frame (figure 7a). No signal 
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was detected with equivalent exposure for the CAG/GIn constructs, but 
overexposure of the blots revealed the presence of smears and protein in the 
wells for these constructs as well. In addition, while discrete bands are resolved 
with pMJDS, signal is only seen in the stacking gel with pMJDIO, suggesting that 

5 the larger polyAla-containing proteins are very insoluble and do not migrate into 
the gel at all. Probing the same blots with 1C2 confirms the presence ofpolyGln- 
containing proteins in the cells transfected with pM JDS, and the absence of such 
proteins in cells transfected with frameshifted constructs (figure 7b). The 
presence of signal in the stacking gel for pMJDS may result from the presence of 

10 enough Gin residues to allow detection by the 1C2 antibody. It could also be due 
to the recruitment of the intact non-frameshifted protein into the insoluble 
aggregates, recruitment of hybrid polyAla/polyGIn protein, or polyGIn protein 
accumulation independent of polyAta polymers. 

EXAMPLE 4 

1 5 PolyGCA/poiyAla stretches alone are toxic and sufficient 

for formation of aggregates 

In order to determine whether a construct with only a GCA 
tract encoding an alanine peptide, outside the context of the ataxin-3 protein, 
would be sufficient to produce a cellular phenotype, a truncated (GC^42 construct 
20 with FS1 and HA epitopes in frame with GCA was transfected into COS-7 cells 
(pMJD1 1 , Fig. 4). In this construct, the MJD-1 sequence was truncated so that 
the resulting protein will only have 25 amino acid residues left upstream from the 
repeat, and only the FS1 and HA epitopes after the GCA tract. At 24 hours, cells 
showed perinuclear and cytoplasmic aggregates and an abnormal nuclear 
25 morphology (Fig. 8a), a phenotype that is very similar to the one obtained by 
transfection of the full-length GCA/Ala constructs described above. Mock- 
transfected cells showed background staining and absence of inclusions (Fig. 
8b). These findings indicate that the presence of a frameshifted CAG rep at 
product is sufficient for toxicity, independent of the protein context, and sugg st 
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a similar pathogenic mechanism may be operating in the other diseases caused 
by expanded CAG tracts. This finding is thus of relevance to expand the 
teachings of the present invention to all diseases or conditions caused by 
expanded CAG tracts and to diseases and conditions caused by the accumulation 
5 of polyalanine-containing proteins. 

EXAMPLE 5 
Antisera production 

A 12-mer peptide corresponding to the new predicted COOH- 
10 terminus of the ataxin-3 protein after frameshift occurs (AAGPIRTEFTSM) was 
used to raise antisera from two rabbits, FS1 and FS2. Injections were perfonned 
using 0.5 mg of peptide conjugated to KLH and sera were collected using 
standard protocols^^. 

EXAMPLE 6 
Western Blots 

For protein extraction, MJD and control lymphoblastoid cells 
were lysed in buffer containing NP-40. Equal amounts of protein were 
electrophoresed in 8% SDS-polyacrylamide gels and transblotted to nitrocellulose 
membranes (as commonly known). Immunodetection was performed using FS1 
(1:300), FS2 (1:300), anti-ataxin-3 (1:1000). 1C2 (1:2000). anti-ubiquitin (1:400; 
DAKO) and FS1 and FS2 pre-immune serum (1:300). Results for FS1 and FS2 
were always consistent; all experiments were repeated three times on different 
blots. HRP conjugated secondary antibodies were used at a 1:10,000 dilution. 
COS-7 cells transfected with various constructs were collected, washed and lysed 
in Sample Loading Buffer. 100 pg of each sample was used to run on a 10% 
SDS-PAGE and transblotted onto nitrocellulose membranes. Immunodetection 
was performed using antisera at following dilutions: 1C2 (1:5000). anti-ataxin-3 
(1:2000) and anti-HA (1:1000). Results were visualized by chemilumin scence 
(RENAISSANCE). 



15 



20 



25 
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EXAMPLE 7 
Immunocytochemistry 

MJD and control lymphoblastoid cells were harvested and a 
total of 50x10^ cells from each cell line were plated ontopoly-D-lysine coated 

5 slides and fixed with acetone/methanol (1:1). Immunodetection was performed 
using FS1 (1:300). anti-ubiquitin (1:300), anti-ataxin-3 (1:500) and FS1 pre- 
immune seaim (1:300). Biotinylated secondary antibodies were used at a 1:500 
dilution and an amplification step was performed using the ABC kit (VECTOR). 
Reaction product was visualized using the VIP kit (VECTOR). Immunocyto- 

10 chemistry on COS-7 cells was perfomied on cover slips. At each time-point cells 
were fixed with 4% paraformaldehyde and immunodetection was performed using 
anti-HA probe (1:500) and FS1 (1:300). Secondary antibodies and subsequent 
amplification and detection procedures were carried out as described above. 

EXAMPLE 8 

1 5 Immunohistochemistry 

For immunohistochemistry of brain sections, 5 \im sections of 
paraffin-embedded tissue from the pons of an MJD patient and a control subject 
were used. Sections were deparaffinized, permeabiiized and immunostained with 
FS1 (1:50) and anti-ubiquitin (DAKO) (1:300). Biotinylated secondary antibodies 

20 were used at a 1 :500 dilution and an amplification step was performed using the 
ABC kit (VECTOR). Reaction product was visualized using the VIP kit 
(VECTOR). For coimmunofluorescence of FS1 and anti-ubiquitin antibodies in 
brain sections, the same procedure for preparation of samples was followed. 
Immunodetection was performed using FS1 (1:50) and monoclonal anti-ubiquitin 

25 (ZYMED) (1 :300). A mixture of Cy3-conjugated anti-mouse antibody (1 :1 00) and 
fluorescein-conjugated anti-rabbit antibody (1:50) was used as secondary probe. 
Sections were mounted in SlowFade (Molecular Probes). 
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EXAMPLE 9 

Plasmid construction, transfection and cell culture 

DMA amplification was perfonned using Pfu DMA polymerase 
(STRATAGENE). Primer MJD-5': TTTTAAGCTTAGACAAA-TAAACATGGAG 
5 (SEQ ID NO:1) was used in conjunction with MJD-3': 
CCGGTGGATCCCTCATCCTGATAGGTCCCGCTGCTG (SEQ ID NO:2) for 
pMJDI, pMJD2 and pMJD3, or MJD-3'c: CCGGTGGATCCCTCA- 
TTGATAGGTCCCGCTGCTG (SEQ ID NO:3) for pMJD4 (STRATAGENE). 
Primer MJD-5XHIII): TTTAAGCTTCCCACCATGGAGT-CATCTTCCA (SEQ ID 

10 NO:4) was used in conjunction with MJD-3*(BI): 
CCGGTGGATCCCTCAGGGCGTAGTCGGGGACGTCGTAGGGGTACATGGAT 
GTGAACTCTGTCCTGATAGGTCCCGCTG (SEQ ID NO:5). for pMJDS and 
pMJD6. or MJD-3'c(Bl): CCGGTGGATCCCAGGGCGTAGTCGGGGACGTCG- 
TAGGGGTACATGGATGTGAACTCTGTCCTGATAGGTCCCGCTG (SEQ ID 

15 NO:6) for pMJD7 and pMJDS, or MJD-Ala: 
CGGAAGAGACGAGAAGCCTACTCCGGAAAAACAGCAGCAAAA-GCAGC 
(SEQ ID NO:7) for pMJD9, pMJDIO pMJD1 1 . Amplified products were digested 
with BamHI and Hindlll and cloned into plasmid pEGFP-NI (CLONTECH), except 
for pMJD1 1, for which the amplified product was cloned into a modified version 

20 of the pEGFP-NI plasmid lacking the EGFP gene. All constructs were confirmed 
by sequencing (as commonly known). COS-7 cells were seeded inDulbecco's 
modified Eaglets medium (DMEM) containing 10% fetal calf serum the day before 
transfection at 2 x ltf per well in 6-well plates containing sterile coverslips. COS- 
7 cells were transfected with plasmid DMA (2.0 pg) using lipofectamine reagent 

25 (GIBCO BRL) according to the manufacturer's instructions. For the experiment 
depicted in figure 4, after 72-96 hours, the cells were fixed with PBS/4% 
paraformaldehyde and obsen/ed under a fluorescent microscope with FITC filter 
in four independent experiments. 

CONCLUSION 
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The present invention thus shows that transcriptional or 
translational frameshifts occurring within expanded CAG tracts result in the 
production and accumulation of polyalanine-containing mutant proteins. These 
alanine polymers might deposit in cells forming INI and lead to nuclear toxicity. 
5 Support for this disease model is provided using lymphoblast cells from MJD 
patients, as well as in pontine neurons of MJD brain and in in vitro cell culture 
models of the disease. Evidence that alanine polymers alone are toxic to cells is 
also provided and strongly suggests that a similar pathogenic mechanism 
underties the other CAG repeat disorders. How these accumulations lead to cell 
1 0 death still needs to be elucidated. 

Although the present invention has been described 
hereinabove by way of preferred embodiments thereof it can be modified, without 
departing from the spirit and nature of the subject invention as defined in the 
appended claims. 
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WHAT IS CLAIMED IS: 

1 . A method for the diagnosis of a disease associated with 
protein accumulation in intranuclear inclusions in a cell of a patient, which 

5 comprises: 

a) obtaining a sample from said patient; and 

b) determining a presence of said protein accumulation in 
said intranuclear inclusions 

wherein said protein accumulation in said intranuclear inclusions is indicative of 
10 a disease associated therewith. 

2. The method of claim 1. wherein said protein is a 
polyalanine-containing protein. 

3. The method of claim 1 or 2, wherein said disease is a 
neurological disease. 

15 4. The method of claim 1 , 2 or 3, wherein said determining is 

carried out with one of a iigand and/or a nucleic acid sequence. 

5. A method for the screening of agents which can modulate 
at least one of: (a) polyamino acid stretch-containing protein expression; (b) 
accumulation of polyamino acid stretch-containing proteins in intranuclear 
20 inclusions; and (c) toxicity of polyamino acid stretch-containing proteins to cells, 
which comprises: 

a) incubating a cell which expresses a polyamino acid stretch-containing 
protein, associated with a disease or condition in an animal, with a 
compound; and 
25 b) assessing one of a) to c); 

whereby a modulator is selected when said agent significantly modulates one of 
said expression, accumulation and toxicity, as compared to a control ag nt. 
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6. The method of claim 5, wherein said polyamino add 
stretch-containing protein is a poiyaianine stretch-containing protein. 



10 



7. The method of claim 5 or 6, wherein said polyamino acid 
stretch-containing protein is expressed by an expression vector which comprises 
a repeat domain. 

8. The method of claim 7, wherein said polyamino acid 
stretch-containing protein is a poiyaianine stretch-containing protein. 

9. The method of claim 8. wherein said poiyaianine stretch is 
encoded by a CAG repeat. 



10. The method of claim 9, wherein said CAG repeat is an 
1 5 uninterrupted CAG tract. 



11. The method of claim 5. wherein said cell is selected from 
a lymphoblast cell from a Machado- Joseph disease (MJD) patient, a pontine 
neuron of MJD brain and an in vitro cell culture model of a neurological disease 

20 associated with said polyamino acid stretch-containing protein. 

12. A method to trigger toxicity in a cell comprising an 
increased expression of an alanine polymer stretch in a protein. 



25 1 3. The method of claim 8, wherein said poiyaianine stretch is 

encoded by a GCG repeat. 

14. The method of claim 5, wherein said cell is isolated from 
an oculopharyngeal muscular dystrophy (OPMD) patient. 
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15. The method of claim 13 or 14, wherein said GCG repeat 
is present in the PABP2 gene. 
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SEQUENCE LISTING 

SEQ ID NO:1 TTTTAAGCTTAGACA/VATAAACATGGAG 

SEQ ID NO:2 CCGGTGGATCCCTCATCCTGATAGGTCCCGCTGCTG 

5 SEQ ID NO:3 CCGGTGGATCCCTCATTGATAGGTCCCGCTGCTG 

SEQ ID NO:4 TTTAAGCTTCCCACCATGGAGTCATCTTCCA 

SEQ ID NO:5 CCGGTGGATCCCTCAGGGCGTAGTCGGGGACGTCG- 

TAGGGGTACATGGATGTG/^CTCTGTCCTGATAGGTCCCG 
CTG 

10 SEQIDNO:6: CCGGTGGATCCCAGGGCGTAGTCGGGGACGTCGTAGGG- 

GTACATGGATGTGAACTCTGTCCTGATAGGTCCCGCTG 

SEQ ID NO:7 CGG/^GAGACGAQAAGCCTACTCCGGA/W^CAGCAGCA- 
A/\AGCAGC 
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□ This report Is also accompanied by ANNEXES, I.e. sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 
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Form PCT/I PEA/409 (cover sheet) (January 1994) 



ONTERNATOONAL PRELllllNARY 

EXAMINATDON REPORT International application No. PCT/CAOO/01052 



I. Basis of the report 

1 . With regard to the elements of the international application (Replacement sheets which have been furnished to 
the receiving Office In response to an Invitation under Article 14 are referred to In this report as ^'originally filed" 
and are not annexed to this report since they do not contain amendments (Rules 70. 16 and 70. 1 7) ): 
Description, pages: 

1 -44 as originally filed 

Claims, No.: 

1-15 as originally filed 

Drawings, sheets: 

1/8-8/8 as originally filed 

Sequence listing part of the description, pages: 

1 -3, as originally filed 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this Item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

H contained in the intemational application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

B furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure In 
the intemational application as filed has been furnished. 

\S The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 
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□ 
□ 
□ 



the description, 
the claims, 
the drawings, 



sheets: 



pages: 
Nos.: 



5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing suet) amendments must be referred to under item 1 and annexed to this 
report,) 

6. Additional observations, if necessary: 

III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1. The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non- 
obvious), or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 
H claims Nos. 1-4. 



^ the said international application, or the said claims Nos. 1-4 with respect to industrial applicability relate to 
the following subject matter which does not require an international preliminary examination {specify): 
see separate sheet 



□ the description, claims or drawings {indicate particular elements belov^ or said claims Nos. are so unclear 
that no meaningful opinion could be formed {specif^: 



□ the claims, or said claims Nos. are so inadequately supported by the description that no meaningful opinion 
could be formed. 

□ no International search report has been established for the said claims Nos. . 

2. A meaningful international preliminary examination cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 
Instructions: 



□ the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been furnished or does not comply with the standard. 



V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



because: 
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1. Statement 



Novelty (N) 



Yes: 
No: 



Claims 
Claims 



9-10, 12, 14 
1-8, 11, 13, 15 



Inventive step (IS) 



Yes: 
No: 



Claims 
Claims 



1-15 



Industrial applicability (lA) Yes: Claims 5-15 

No: Claims 

2. Citations and explanations 
see separate sheet 

VI. Certain documents cited 

1. Certain published documents (Rule 70.10) 

and / or 

2. Non-written disclosures (Rule 70.9) 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 
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Rett mill 

Non-establishment of opinion with regard to novelty, inventive step and 
industrial applicability 

Claims 1-4 relate to subject-matter considered by this Authority to be covered by 
the provisions of Rule 67.1(iv) PCT. Consequently, no opinion will be fonnulated 
with respect to the industrial applicability of the subject-matter of these claims 
(Article 34(4)(a)(i) PCT). See point V-4 below. 

Re Item V 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or 
industrial applicability; citations and explanations supporting such statement 

1 - Reference is made to the following documents : 

D1 : WO 99 29896 A (Brais B, Univ. McGill), 17 June 1999, cited in the 
application 

D2: Brals B et al. : 'Short GCG expansions in the PABP2 gene cause 

oculopharyngeal muscular dystrophy' Nature Genetics, vol. 18, no. 2, 18 
February 1 998, pages 1 64-1 67, cited in the application 

D3: Ordway JM et al. : 'Ectopically expressed CAG repeats cause intranuclear 
inclusions and a progressive late onset neurological phenotype in the 
mouse.' Cell, vol. 91, no. 6, 12 December 1997, pages 753-763, cited in the 
application 

D4: Caspar C et al. : 'CAG tract of MJD-1 may be prone to frameshlfts causing 
polyalanine accumulation.' Hum. Mol. Genet., vol. 9, no. 13, 2000, pages 
1957-1966 



2 - Novelty - Art. 33(1) and (2) PCT : 

2.1 Document D1 discloses that expanded (GCG) repeat in the poly(A) binding 

protein II (PAB II) gene, encoding a polyalanine tract located at the N-terminus of 
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the protein is associated with oculopharyngeal muscular dystrophy (OPMD) (p. l , 
lines 21-23), and suggests that pathological expansions of the polyalanine (polyA) 
tract may cause mutated PAB II oligomers to accumulate as filament inclusions In 
the nuclei (p. 1 , lines 30-32). Said gene is therefore contemplated as a tool for the 
diagnosis and treatment of a disease related with polyA accumulation in nucleus 
such as OPMD (p. 2, lines 3-6). Indeed, document D1 describes a method for the 
diagnosis of a disease with protein accumulation In nucleus which comprises the 
steps of : (a) obtaining a nucleic acid sample of said patient ; (b) determining 
allelic variants of GCG repeat of the PAB II gene wherein long allelic variants are 
indicative of a protein accumulation in the nucleus, such as polyA accumulation 
(p. 2, lines 19-29 ; claims 6-7 ). Moreover, document D1 reports on a method for 
the screening of therapeutic agents for the prevention and/or treatment of OPMD 
that comprises the steps of (a) generating a non-human model for the PAB II gene 
whose germ and somatic cells are modified to express at least one allelic variant 
of GCG repeats of the human PB II gene ; (b) administering said therapeutic 
agents to the non-human model ; (c) evaluating the prevention and/or treatment of 
development of OPMD in said mammal (p. 2, line 33 to p. 3, line 14 ; claim 10). 
Hence, document D1 appears to be novelty destroying for the subject-matter of 
claims 1-4. 5-8. 11. 13 and 15 . 

2.2 Document D2 discloses that nuclear filament Inclusions are the pathological 
hallmark of OPMD (Abstract, lines 5-6) and describes the screening for PAPB2 
mutations, i.e., GCG repeat expansions from patients diagnosed as having OPMD 
on clinical grounds using RT-PCR analyses (Abstract, lines 9-11 ; p. 166, col.1, 
last paragraph to col. 2, second paragraph). Document 2 thus provides evidence 
that pathological expansions of the polyA tract may cause mutated PABP2 
oligomers to accumulate as filament inclusions in nuclei (Abstract, 3 last lines). In 
light of document D2, the subject-matter of claims 1-4 can therefore not be 
regarded as novel. 

2.3 The available prior art documents disclose neither a method for the screening of 
agents according to claims 9-10 and 14, nor a method to trigger toxicity In a cell 
according to claim 12. Therefore, the subject-matter of claims 9-10. 12 and 14 can 
be considered as novel. 
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3 - Inventiv step - Art. 33(1) and (3) PCX : 

3.1 Document D3 reports that the mutations responsible for several human 
neurological degenerative disorders including Huntington's disease and 
spinocerebellar ataxia SCA3 (Machado-Joseph disease, MJD) are expansions of 
translated CAG repeats. Document D3 describes mutant mice that express a form 
of HPRT protein containing a long polygin repeat after insertion of a 146-unit CAG 
repeat into corresponding gene. Said mice develop a phenotype similar to the 
human translated CAG repeat disorders and show a late onset neurological 
phenotype that progresses to premature death (Abstract). CAG repeats are 
responsible for similar effects on the proteins they expand as those provoked by 
GCG repeats. Therefore, the skilled person would regard It a normal design 
procedure to combine all the features set out in claims 9-10, especially as the 
advantages thus achieved can readily be contemplated in advance. Thus, the 
subject-matter of claims 9-10 cannot be considered as involving an inventive step. 

3.2 Document D3 provides evidence that CAG repeats do not need to be located 
within one of the classic repeat disorder genes to have a neurotoxic effect 
(Abstract). In light of document D1 teaching that GCG repeats encoding polygin 
provoke related effects, it would be obvious to the person skilled in the art to 

contemplate applying the GCG repeats encoding polyala stretch to the method of 

triggering cell neurotoxicity disclosed in document D3, thereby arriving at the 
method according to claim 12. Consequently, the subject-matter of independent 
claim 12 cannot be regarded as involving an inventive step, 

3.3 In light of document 01 , the subject-matter of claim 14 cannot be regarded as 
involving an inventive step since it falls within the customary practice followed by 
one skilled in the art. 



4 - Industrial applicability - Art. 33(1) and (4) PCT : 

Due to the step of "obtaining a sample from a patient", the method according to 
claim 1 is considered as comprising a surgical step carried out on a living human 
or animal body. 
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For the assessment of the present claims 1-4 on the question whether they are 
industrially applicable, no unified criteria exist in the PCT Contracting States. The 
patentability can also be dependent upon the fomiulation of the claims. The EPO, 
for example, does not recognize as industrially applicable the subject-matter of 
claims to a method for treatment of the human body by therapy, or the use of a 
compound in medical treatment, but may allow, however, claims to a known com- 
pound for first use in medical treatment and the use of such a compound for the 
manufacture of a medicament for a new medical treatment. Hence, should the 
Applicant wish to enter the European Regional phase, he should ensure that 
"obtaining a sample from a patient" is not included as a distinct step in the method 
of claims 1 -4. 

5 - P-documents ; 

Document D4, was published after the priority date, but before the filing date of 
the present application and is therefore relevant only for those parts, if any, of the 
present application which do not have a valid claim to priority. 



Re Item VI 

Certain documents cited 

Certain published documents (Rule 70.10) 

Application No Publication date Filing date Priority date (valid claim) 

Patent No (day/month/year) (day/month/year) (day/month/year) 

WO 00/26675 1 1 .05.00 03.1 1 .99 03.1 1 .98 

Should the present application enter the national or regional phase, the above 
document could be relevant to the question of novelty. 
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Certain observations on the international application 

1 . Since a CAG codon encodes a glutamine amino acid, a polyalanlne stretch cannot 
be encoded by a CAG repeat. Therefore, the subject-matter of claim 9 laclcs clarity 
and has been examined as reading "said polyamino acid stretch" instead of "said 
polyalanine stretch" (Art. 6 PCT). 

2. For the sake of clarity, the reference of claim 15 to claim 14 should have been 
deleted since claim 14 does not concern any GCG repeat (Art. 6 PCT). 
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